Molecular Phylogenetics and Evolution
○ Elsevier BV
Preprints posted in the last 30 days, ranked by how well they match Molecular Phylogenetics and Evolution's content profile, based on 61 papers previously published here. The average preprint has a 0.02% match score for this journal, so anything above that is already an above-average fit.
Smith, M. L.; Moshier, S.; Shoobs, N. F.
Show abstract
The temperate rainforests of the Pacific Northwest of North America harbor many endemic taxa whose evolutionary histories have been shaped by major climatic and geologic events. The enigmatic taildropper slugs (genus Prophysaon) are one example, notable for their ability to autonomize their tails to escape predators. Despite extensive work uncovering the evolutionary history of individual lineages, relationships among the nine recognized species of Prophysaon remain poorly understood due to insufficient molecular data. To address this, we collected transcriptomes for six of the nine currently accepted species of Prophysaon. Using these data, we were able to resolve species relationships, calling into question the existing subgeneric classification based on morphology. We also detected undescribed phenotypic diversity within the P. andersonii--P. foliolatum species complex, with molecular data supporting the distinctness of two phenotypically distinct populations from Washington. Finally, our transcriptomic data suggest a moderate role of introgression in shaping the evolutionary history of Prophysaon. Here, we synonymize the subgenus Mimetarion with nominotypical Prophysaon. Future work should further investigate whether the undescribed diversity detected here represents species level differentiation.
Pacheco, M. A.; Schaer, J.; Werb, O.; Mello, B.; Escalante, A. A.
Show abstract
Haemosporida is a diverse order of vector-borne apicomplexan parasites infecting terrestrial vertebrates worldwide, including humans, but the evolutionary relationships among its genera remain unresolved. The phylogenetic placement of two bat-restricted genera, Nycteria and Polychromophilus, both of which lack erythrocytic schizogony, has varied across studies depending on taxon sampling and marker choice. To address this problem, an expanded dataset of near-complete mitochondrial (mtDNA) genomes together with nine nuclear loci were analyzed. Phylogenetic analyses of mtDNA recovered Nycteria and Polychromophilus as a strongly supported monophyletic clade. In contrast, analyses based only on the three mitochondrial coding genes (CDS) or a reduced nuclear dataset failed to recover their monophyly and showed low support and extensive topological conflict at deeper nodes. These results indicate that near-complete mitochondrial genomes recover phylogenetic signal that is not captured by reduced mitochondrial coding sequences or partial nuclear datasets. Molecular dating analyses further showed that divergence estimates for a putative Nycteria-Polychromophilus clade are compatible with the proposed times for bats diversification, and consistent with the broader haemosporidian timescale. When the Nycteria-Polychromophilus clade was incorporated as a calibration prior, divergence-time estimates became more precise without altering the overall evolutionary timeframe. Substantial mitochondrial gene-order rearrangements in a distinct Nycteria lineage were confirmed, highlighting structural divergence within this bat-associated group. In addition, heterogeneity in rates across mtDNA haemosporidian lineages was observed. Together, these findings support the existence of a distinct bat-associated clade whose deeper placement and evolutionary significance should be tested with broader phylogenomic sampling. Author SummaryMalaria parasites belong to a diverse group of organisms that infect many kinds of vertebrates, including birds, reptiles, and mammals (such as humans). Understanding how these parasites are related to each other is important for explaining how key biological traits have evolved. However, the relationships among major groups of haemosporidian parasites, including malaria parasites, remain unclear, particularly for those infecting bats. In this study, we focused on two groups of bat parasites, Nycteria and Polychromophilus, which share unusual biological features. The inferred evolutionary relationships of these two genera to other haemosporidians have been inconsistent across previous studies. By analyzing near-complete mitochondrial genomes, we found strong evidence that these two groups descended from a common evolutionary ancestor. In contrast, smaller datasets including nuclear genes failed to recover this relationship and produced conflicting results, suggesting that they lack sufficient information to resolve deep evolutionary relationships. We also found that this bat-associated lineage likely originated around the same time as early bats. In addition, we identified structural changes in the mitochondrial genome of one lineage, highlighting its evolutionary distinctiveness. Together, our results suggest that bats host a unique group of malaria parasites and demonstrate that more complete genetic data are essential for resolving their evolutionary history.
Nanjala, C.; Simpson, L.; Hu, A.-Q.; Patel, V.; Nicholls, J. A.; Bent, S. J.; Gale, S. W.; Fischer, G. A.; Goedderz, S.; Schuiteman, A.; Crayn, D.; Clements, M. A.; Nargar, K.
Show abstract
Understanding evolutionary relationships in hyperdiverse plant groups remains a major challenge in systematics. The orchid genus Bulbophyllum, the second largest genus of flowering plants, represents an exceptional example of phylogenetic and morphological complexity. Relationships, particularly within the species-rich Asian clade, have remained poorly resolved due to extensive morphological variation and limited resolution in previous phylogenetic studies. Here, we reconstructed phylogenetic relationships using 63 plastid genes from 355 specimens representing 322 species and 65 of the 97 recognised sections of Bulbophyllum. Our analyses confirmed that the genus comprises five major evolutionary lineages comprised of species predominantly from Australasia, Madagascar, Continental Africa, Neotropics, and Asia. We provide the first robust phylogenetic evidence for a dichotomous split within the Asian clade into two well-supported lineages: the Asian-Malesian clade and the Malesian-Papuasian clade, with the latter containing a strongly supported Papuasian subclade. Additionally, this study supports the monophyly of several currently recognised sections while clarifying relationships in previously problematic groups. This study provides the most comprehensive plastid-based phylogenomic framework for Bulbophyllum to date and establishes a foundation for future taxonomic revision and integrative analyses of diversification and trait evolution within this hyperdiverse genus.
Leroy, R. B.; Eme, L.; Lopez-Garcia, P.; Moreira, D.
Show abstract
Understanding the phylogenetic relationships among eukaryotic lineages is essential for tracing the evolution of key phenotypic traits and inferring the nature of the Last Eukaryotic Common Ancestor. While phylogenomic analyses have clustered eukaryotic taxa into several well-supported major supergroups, the relationships among them remain largely uncertain. Phylogenetic signal erosion over deep time and limited available taxon sampling are among the possible causes. However, most previous studies rely on variations of the same core protein dataset, hence containing the same potential systematic biases. Here, we reconstructed the eukaryotic Tree of Life using a largely independent, marker-rich dataset derived from highly conserved Benchmarking Universal Single-Copy Orthologs. Unlike previous collections, our 277-marker supermatrix minimizes ribosomal protein representation and shares less than 25% overlap with previous datasets. State-of-the-art analyses of this dataset confirm most eukaryotic supergroups previously observed, but suggest different positions for some lineages. Notably, Telonemia clusters with Haptophyta rather than SAR (Stramenopiles-Alveolata-Rhizaria), and Ancyromonadida and Malawimonadida form a monophyletic group at the base of the Opimoda. Our results highlight the importance of analyzing independent phylogenomic datasets and support the hypothesis that extant eukaryotic diversity encompasses a small number of high-rank, supergroup lineages.
Nagel, A. A.; Landis, M. J.
Show abstract
Ancestral state reconstruction is a classical problem of broad relevance in phylogenetics. Likelihood-based methods for reconstructing ancestral states under discrete character models, such as Markov models, have proven extremely useful, but only work so long as the assumed model yields a tractable likelihood function. Unfortunately, extending a simple but tractable phylogenetic model to possess new, but biologically realistic, properties often results in an intractable likelihood, preventing its use in standard modeling tasks, including ancestral state reconstruction. The rapid advancement of deep learning offers a potential alternative to likelihood-based inference of ancestral states, particularly for models with intractable likelihoods. In this study, we modify the phylogenetic deep learning software O_SCPLOWPHYDDLEC_SCPLOW to conduct ancestral state reconstruction. We evaluate O_SCPLOWPHYDDLEC_SCPLOWs performance under various methodological and modeling conditions, while comparing to Bayesian inference when possible. For simple models and small trees, its performance resembles the performance of Bayesian inference, but worsens as tree size increases. While O_SCPLOWPHYDDLEC_SCPLOW still performs adequately for more complex models, such as speciation and extinction models, the estimates differ more from Bayesian inference in comparison with simpler models. Lastly, we use O_SCPLOWPHYDDLEC_SCPLOW to infer ancestral states for two empirical datasets, one of the ancestral ranges of a subclade of the genus Liolaemus and ancestral locations for sequences from the 2014 Sierra Leone Ebola virus disease outbreak.
Leache, A.; Davis, H.; Guerra, E.; Herrera, A.; Lemos-Espinal, J.; Fujita, M.; Myers, T. C.; Singhal, S.
Show abstract
Species delimitation is a fundamental challenge in systematic biology, particularly for geographically variable taxa with hierarchical population structure and gene flow. Migration-aware coalescent models provide a powerful framework for investigating lineage divergence and accurately defining species boundaries. In this study, we combine statistical evaluations of gene flow with phylogenetic and population structure analyses to delimit species of fence lizards within the Sceloporus undulatus complex, a group characterized by extensive population subdivision, mitochondrial DNA introgression, and nuclear gene flow. We find that the undulatus complex exhibits uneven variation in genetic, morphological, and bioclimatic traits, resulting in variable distinctiveness among groups. In some cases, species boundaries are recognized by clear genetic discontinuities without gene flow. In others, shallow divergence, paraphyly, and gene flow produce leaky boundaries and fuzzy species limits. Mitochondrial introgression is extensive and concentrated at species boundaries, whereas nuclear gene flow occurs between only a few species and at much lower levels than within species. Neither within-species populations or species are substantially diverged across morphology or bioclimatic space, highlighting the limited utility of these traits for diagnosing species in this group. By integrating estimates of gene flow with phylogenetic and population structure analyses, this study provides a robust and biologically meaningful revised taxonomic framework for the undulatus complex that identifies independently evolving lineages as species.
Whittall, J. B.; Zhang, M.; Guiton, P. S.
Show abstract
In Toxoplasma gondii, microneme proteins (MICs) are secreted components of the apical complex that play central roles in motility, host cell attachment, and invasion. Because proteins at the host-parasite interface are often predicted to evolve rapidly, MICs have been suggested as candidates for adaptive diversification. We tested this expectation using comparative analyses of three relatively understudied microneme proteins, MIC13, MIC12, and MIC16. Coding sequences were assembled from GenBank and ToxoDB, aligned by translation, and analyzed using maximum-likelihood phylogenetics, codon-based tests of selection, and predicted protein structure. MIC13 was represented by 51 sequences, MIC12 by 30, and MIC16 by 34, spanning multiple T. gondii haplogroups and including Hammondia hammondi and Neospora caninum as outgroups. All three genes were highly conserved among T. gondii strains, but their phylogenetic trees were topologically incongruent, indicating that individual MICs do not recover a single shared strain history. Contrary to expectation, no positively selected codons were detected in any gene. Instead, purifying selection was detected at one site in MIC13 and 15 sites in MIC12, while no significant codon-specific selection was detected in MIC16. Several constrained MIC12 sites overlapped annotated EGF and calcium-binding EGF-like domains, consistent with structural conservation of extracellular adhesion modules. AlphaFold prediction of MIC13 supported two sialic acid-binding micronemal adhesive repeat regions, but the single constrained MIC13 site did not overlap these motifs. Together, these results indicate that MIC13, MIC12, and MIC16 are shaped more by sequence conservation and heterogeneous gene histories than by strong recurrent positive selection. These findings refine expectations for microneme evolution in T. gondii and highlight conserved domains that may be important for parasite invasion and future functional study.
Montoliu-Nerin, M.; Strunov, A.; Heyworth, E.; Schneider, D. I.; Thoma, J.; Hua-Van, A.; Courret, C.; Klasson, L. J.; Miller, W. J.
Show abstract
BackgroundAlthough strict maternal transmission of mitochondria is a general feature of animals and humans for ensuring homogeneity in mitochondrial DNA (mtDNA) across generations, exceptions were reported in the recent past. For example, some extremely rare but spectacular cases of heteroplasmy and paternal transmission in humans have questioned the universal evolutionary principle. Hence, as an alternative, the Mega-NUMT concept was coined to explain this discovery and was thereafter partly proven to exist. This concept expands on the quite common transfer of mtDNA fragments to the nucleus (NUMTs) by considering the existence of multicopy mitochondrial nuclear insertions. Mega-NUMT reports are currently restricted to a few cases in animals, including humans. However, even in humans, their detailed genomic organization, natural prevalence, and potential biological functions remain unclear. Methodology/Principal FindingsHere, we discovered that up to 60 full-sized mitochondrial genomes are integrated into the nuclear genome of the neotropical fruit fly Drosophila paulistorum using long-read sequencing and confirmed their presence by in situ hybridization. The copies are organized in one cluster on chromosome 3, which we, due to its similarity with the Mega-NUMT concept, designated the "Dpau Mega-NUMT". Contrary to the rarity in humans, this Mega-NUMT is found at high prevalence (40%) in both long-term laboratory lines and natural D. paulistorum populations of different semispecies. Additionally, the mitochondrial copies in the Mega-NUMT cluster are phylogenetically separated from the current mitotypes of D. paulistorum. Together, these observations suggest long-term maintenance of the Mega-NUMT in nature. Hence, we propose that the Dpau Mega-NUMT may have been transferred to the nuclear genome before D. paulistorum semispecies radiation and maintained at relatively high prevalence in nature by balancing selection due to yet undetermined functions. Conclusions/SignificanceTo our knowledge, this is the first verified existence and detailed dissection of a Mega-NUMT outside cats and humans. We show that Mega-NUMTs can be persistent in nature, even at high prevalence, potentially due to balancing selection. Our findings strengthen the importance of high-quality long-read sequencing technologies for deciphering complex repeat-rich genomic regions to deepen our understanding of the dynamics of genome evolution within genomic "dark matter".
Milkey, A.; Chen, J.; Lewis, P. O.
Show abstract
AO_SCPLOWBSTRACTC_SCPLOWAs modern phylogenomics datasets become increasingly large, it is useful to develop recommendations for how to subsample datasets for best species tree inference. Here we apply a new measure of phylogenetic information content that estimates the reduction in tree space occupied by a posterior sample of inferred trees relative to a prior sample in order to assess the effects of gene tree parameters on species tree estimation. We find that, consistent with earlier studies, when data are informative, more data result in better species tree inference. However, when data are uninformative, subsampling a dataset to include only the most informative loci may produce a better species tree sample. We perform analyses on a variety of simulated and empirical datasets.
Boyane, S. S.; Behrends, G. J.; Manthey, J. D.
Show abstract
Codiversification often arises when hosts and their endosymbionts share a linked evolutionary history, exhibit vertical transmission, or share ecological and biogeographic processes. Most studies on the codiversification of carpenter ants (genus Camponotus) have focused on the co-phylogeny of hosts and endosymbionts across multiple species; however, no studies have examined the intraspecific population-level phylogeographic patterns of codiversification within Camponotus. California has been a geographic focus for phylogeographic studies due to its high endemism and complex geographic structure, and Camponotus laevigatus is a carpenter ant primarily found there. Here, we used whole-genome sequencing from C. laevigatus and its endosymbiont, Blochmaniella to investigate phylogeographic patterns of host-endosymbiont codiversification and estimated kinship of ants sampled near one another. We identified three phylogeographic clusters and isolation-by-distance analyses indicated a positive relationship between genetic and geographic distance in C. laevigatus and Blochmaniella. Using estimates of effective migration surfaces, we found that the Central Valley in California acts as a significant barrier to gene flow among populations. Our phylogenetic analyses revealed the congruent phylogenies of C. laevigatus and Blochmaniella, supporting codiversification. We also estimated kinship among individuals from the same and nearby sampling sites; kinship results indicated full-sister relationships among individuals from the same sampling site, except for three pairwise comparisons, and foragers from nearby sampling sites displayed some shared kinship. Lastly, our demographic analysis revealed a Pleistocene divergence, highlighting the role of Quaternary climatic cycles in shaping the population structure of C. laevigatus.
Rodriguez-Rojas, P. C.; Oceguera-Figueroa, A. F.; Navarro-Siguenza, A. G.; Vazquez Miranda, H.
Show abstract
Text AbstractIn this study, we characterized the genetic structure and reconstructed the demographic history of cactus wrens (Campylorhynchus brunneicapillus), an endemic species of desert regions of North America, that shows a clear phenotypic and genotypic variation. We evaluated the effects of historical climate change on the structure and population dynamics of desert species using genomic data through genotyping by sequencing (GBS) and applied a population structure analysis (FST and ADMIXTURE), revealing two genetically differentiated groups: one continental and another peninsular in Baja California. Subsequently, we implemented the MSMC2 coalescent model on data divided into autosomal regions and the Z sex chromosome to estimate changes in effective population size (Ne) through evolutionary time. Additionally, we developed ecological niche models (ENMs) projected to the Last Glacial Maximum (LGM), Last Interglacial (LIG), Present times, and Future (2060 - 2080). Results indicate that both populations maintained moderated Nes before the LGM, experienced severe bottlenecks (Ne [~] 102-103), followed by a sustained expansion. However, recovery was limited to the Z chromosome of the peninsular population. These findings reveal how glaciations and interglacials shaped the evolutionary history of desert species and provide genomic evidence of the splitting of C. affinis from C. brunneicapillus. Article summaryThis research examines how climate changes shaped genetic diversity of cactus wrens across North American warm deserts. Using coalescent methods, researchers tracked effective population size changes over 100,000 years, using ecological niche modeling they predicted habitat suitability across climate periods. Results showed that continental and peninsular populations experienced bottlenecks during the Last Glacial Maximum, followed by demographic recovery on warm periods. However, the sex chromosome (Z) revealed male-biased demographic patterns in peninsular populations. Future projections indicated habitat suitability reductions for peninsular populations, highlighting conservation concerns. These findings demonstrate that past climate shaped genetic diversity of cactus wrens.
Seckin, E.; Colinet, D.; Bailly-Bechet, M.; Seassau, A.; Bottini, S.; Sarti, E.; Danchin, E. G.
Show abstract
Orphan genes, lacking homologs in other species, are systematically found across genomes. Their presence may result from extensive divergence from pre-existing genes or from de novo gene birth, which occurs when a gene emerges from a previously non-genic region. In this study, we identified orphan genes in the genomes of globally distributed plant-parasitic nematodes of the genus Meloidogyne and investigated their origins, evolution, and characteristics. Using a comparative genomics framework across 85 nematode species, we found that 18% of Meloidogyne genes are genus-specific, transcriptionally supported orphans. By combining ancestral sequence reconstruction and synteny-based approaches, we inferred that 20% of these orphan genes originated through high divergence, while 18% likely emerged de novo. Proteomic and translatomic evidence confirmed the translation of a subset of these genes, and feature analyses revealed distinctive molecular signatures, including shorter length, signal peptide enrichment, and a tendency for extracellular localization. These findings highlight orphan genes as a substantial and previously underexplored component of the Meloidogyne genome, with potential roles in their worldwide parasitism.
Dervaux, J.; Brunet, P.
Show abstract
The growth of cultures and formation of mucilage blooms in reaction to salt stress of cyanobacterial cultures are investigated with a focus on the influence of pH. In non-buffered medium, cultures show their pH increasing from 6.5 just after inoculation, up to 11 during the exponential phase. We record the time-evolution of concentration and pH, with different initial OD0. In a second set of experiments, we extract the doubling time of the unbuffered cultures in comparison with those inoculated in pH-buffered BG11 media at four different pH from 6.3 to 10.5 : in the most acid media, all cultures die or grow very slowly. At pH = 10.5, we obtain the fastest growth for all four strains, allowing to qualify these cyanobacteria as being alkaliphiles, though for all strains with comparable initial OD0, the doubling time is shorter for unbuffered cultures. Following a previous study [31]), we finally investigate the influence of pH on mucilage formation and biomass uplift induced by salt stress, involving EPS floculation by cations. Our results show that operating in buffered media significantly influences the mucilage formation, though the observed regimes cannot be simply correlated to the pH value.
van den Burg, M. P.; Thibaudier, J.
Show abstract
Understanding behavioral differences between non-native and closely related endangered species could be important to aid conservation management. In volume 169 of Zoology, Bels et al. (2025) reported on their comparison of display-action-patterns (DAP) between native Iguana delicatissima and non-native iguanas present on islands of the Guadeloupe Archipelago in the Caribbean Lesser Antilles. Here, we address conceptual and methodological concerns about their work and reanalyze their data given our proposed corrections, primarily a literature-informed adjustment of their "species" category. We additionally utilize online videos from South American mainland I. iguana populations, from where the non-native iguanas in the Guadeloupe Archipelago originate, to better understand the different DAPs between native and non-native iguanas in the Guadeloupe Archipelago. Significant differences in DAP characteristics among "species" categories (native I. delicatissima, non-native iguanas, and hybrids) show that Bels et al. (2025) oversimplified their data analyses by merging all non-native populations into one group. This result indicates the presence of behavioral variation among subpopulations within widely hybridizing iguanid populations, which has been poorly studied. Additionally, videos from mainland populations across two major mitochondrial clades of Iguana iguana show that non-native iguanas on Guadeloupe retained DAP characteristics of those populations from which they originate. We discuss these findings in light of the proposed hypotheses put forward by Bels et al. (2025), of which two can be excluded. Overall, our reanalysis shows that studies focusing on characteristics within settings of complex hybridization in diverse species should acknowledge this complexity.
Dittami, S. M.; Hudson, J.; Brillet-Gueguen, L.; Ficko-Blean, E.; Tanguy, G.; Rousvoal, S.; Legeay, E.; Markov, G. V.; Delage, L.; Godfroy, O.; Corre, E.; Collen, J.; Leblanc, C.; Egan, S.
Show abstract
BackgroundRed macroalgae (Rhodophyta) are ecologically and economically important marine primary producers, yet genomic resources for most species remain scarce. Delisea pulchra, a temperate red alga known for its halogenated furanone-based chemical defenses, serves as a model for studying algal-microbe interactions, antifouling mechanisms, and disease dynamics. ResultsHere we present a high-quality genome assembly of this species. The nuclear genome comprises 134 Mbp across 271 contigs with an N50 of 1.47 Mbp and encodes 13,387 predicted protein-coding genes. Comparative genomics with other red algae revealed expansions in gene families involved in DNA methylation, and oxidative stress responses, including glutathione S-transferases and superoxide dismutases. Analysis of glycosyltransferases, sulfatases, and sulfurylases implicated in galactan biosynthesis suggests D. pulchra possesses a complex and potentially novel extracellular matrix. We also identified several vanadium haloperoxidases (vHPOs), heme-dependent haloperoxidases (hHPOs), and two type III polyketide synthase (PKS) genes unique to the D. pulchra, which together represent promising candidate genes for bromofuranone production. ConclusionThe D. pulchra genome provides a foundation for molecular investigations into defense, signaling, and host-microbe interactions. It has been deposited at the European Nucleotide Archive under accession number PRJEB101077. All datasets, annotations, and interactive tools for exploring the genome are also available through the Rhodoexplorer portal at https://rhodoexplorer.sb-roscoff.fr.
Feigin, C. Y.; Trybulec, E.; Ferguson, R.; Scicluna, E. L.; Sauermann, R.; Hartley, G. A.; O'Neill, R. J.; Pask, A. J.
Show abstract
Small marsupials in the family Dasyuridae are a key component of Australias arid and semi-arid fauna, whose high species richness is proposed to reflect an opportunity-driven adaptive radiation. Despite growing interest in this group from both ecological and evolutionary perspectives, genomic data for most species is non-existent, or limited to a few marker loci. Here, we generated a chromosome-level reference genome and a de novo mitochondrial genome for the desert-dwelling Wongai ningaui (Ningaui ridei). The nuclear genome assembly is highly contiguous, with a scaffold N50 of 594.484 MB and high BUSCO gene recovery (93.84%). Additionally, we produced a draft assembly for the related, semi-arid slender-tailed dunnart (Sminthopsis murina). We then used these assemblies to explore the demographic histories of these species. We find evidence for contrasting patterns of population growth during the late Pleistocene and early Holocene, corresponding with differences in local climate, potentially consistent with differences in optimal habitat. The new genomic resources and demographic findings presented here provide a foundation for future studies on adaptive specialisation in this group of Australian marsupials. Significance StatementDasyurid marsupials are the primary carnivorous and insectivorous mammals in Australia. This diverse family includes species such as the endangered Tasmanian devil (Sarcophilus harrisii) and quolls (Genus Dasyurus), as well as an emerging laboratory model species, the fat-tailed dunnart (Sminthopsis crassicaudata). Despite the species richness within dasyurids, most species remain under-studied. This is particularly true of arid and semi-arid zone species, who are often small in size, live in remote habitats and are cryptic by nature. By creating genome assemblies for two dasyurid species, this study provides resources to support a variety of phylogenetic, population genetic and evolutionary developmental lines of research. Importantly, the studys finding that arid and semi-arid dasyurids show distinct trajectories of demographic change in response to historical climatic shifts may point to local adaptations with implications for the resilience of these species to ongoing and future climate change.
Koshkarov, A.; Tahiri, N.
Show abstract
Phylogenetic trees represent the evolutionary histories of taxa and support tasks such as clustering and Tree of Life reconstruction. Many established comparison methods, including the Robinson-Foulds (RF) distance, assume identical taxon sets. A methodological gap remains for trees with distinct but overlapping taxa. Existing approaches either prune non-common leaves, which can discard information, or complete both trees such that they share the same taxa. Completion is more comprehensive, but current methods typically ignore branch lengths, which are essential for identifying evolutionary patterns. This paper introduces k-Nearest Common Leaves (k-NCL), an algorithm for completing rooted phylogenetic trees defined on different but overlapping taxa. The method uses branch lengths and topological characteristics and does not rely on a specific distance measure. The k-NCL algorithm is designed to preserve evolutionary relationships in the trees under comparison. The running time is O(n2), where n is the size of the union of the two leaf sets. Additional properties include preservation of original distances and topology, symmetry, and uniqueness of the completion. Implemented in Python, k-NCL is evaluated on biological datasets of amphibians, birds, mammals, and sharks. Experimental results show that RF combined with k-NCL improves phylogenetic tree clustering performance compared to the RF(+) tree completion approach. Availability and implementationAn open-source implementation of k-NCL in Python and the datasets used in this study are available at https://github.com/tahiri-lab/KNCL.
Soares, L. S.; Fagundes, N. R.; Bombarely, A.; Freitas, L. B.
Show abstract
The remarkable diversity of life on Earth results from evolutionary processes functioning across different spatial and temporal scales. Species diversification occurs through various mechanisms and at widely varying rates, but identifying the conditions that trigger bursts of diversification over short timescales remains a central challenge in evolutionary biology. This difficulty is more pronounced when incomplete lineage sorting (ILS), hybridization, and ongoing gene flow obscure evolutionary relationships and complicate species delimitation. In this study, we investigated the evolutionary history and species boundaries within a group of recently diverged Petunia lineages shaped by pervasive gene flow. We integrated phylogenomic, population genetic, and species delimitation approaches to reconstruct lineage relationships and assess whether these lineages represent distinct species or stages along a speciation continuum. By applying methods that account for both ILS and gene flow, we revealed that most lineages are not fully independent evolutionary units but rather occupy intermediate positions along this continuum. Gene flow played a crucial role during diversification, blurring species boundaries and generating reticulate evolutionary patterns. Our findings demonstrate that traditional phylogenetic trees may oversimplify relationships in such systems, while phylogenetic networks offer a more accurate representation of evolutionary history. Comprehensive and integrative analyses, such as those employed here, are essential for capturing these complex dynamics. Ultimately, only four lineages could be confidently recognized as distinct species, whereas the remaining represent cases of ongoing divergence. These results emphasize the need to refine species delimitation frameworks for systems characterized by recent divergence and extensive reticulation.
Leon, A.; Henriet, S.; Lagman, D.; Martin, S. B.; Canal, A.; Alleon, G.; Lenfant, C.; Aasjord, A. E.; Chourrout, D.
Show abstract
In tunicates, larvaceans represent a fascinating case of evolution, where the chordate body plan has been maintained despite a rapidly evolving genome characterized by strong In contrast to other tunicates, larvaceans keep the chordate body plan during their entire life. They have acquired a highly specialized epithelium in charge of producing the "house", a complex extracellular apparatus used for filter feeding in the plankton. To what extent the house and this epithelium represent true molecular innovations withing chordates is a question for which thorough transcriptomics can bring novel insights. We conducted a developmental profiling of gene expression at the single-cell level in the larvacean Oikopleura dioica. We provide detailed descriptions of cellular transcriptomes associated with the house-synthesizing organ, which permits to define the molecular specifics of epithelial cell territories. We followed their emergence during development, and we identified genes that represent key candidate molecules for regulating the morphogenesis of the house-producing organ. Dynamic changes in gene expression and cell identities during major developmental transitions of the lifecycle illustrate that our dataset effectively allows access to the diversity of O. dioicas cell types in embryos and in adults. The resources presented here constitute critical assets to investigate larvacean biology and evolution for mechanistic and comparative goals.
Milkey, A.; Lewis, P. O.
Show abstract
AO_SCPLOWBSTRACTC_SCPLOWA new Bayesian measure of phylogenetic information content is introduced based on geodesic distances in treespace. The measure is based on the relative variance of phylogenetic trees sampled from the posterior distribution compared to the prior distribution. This ratio is expected to equal 1 if there is no information in the data about phylogeny and 0 if there is complete information. Trees can be scaled to have the same mean tree length to avoid dominance by edge length information and focus on topological information. The method scales well, requiring only that a valid sample can be obtained from both prior and posterior distributions. We show how dissonance (information conflict) among data sets can also be estimated. Both simulated and empirical examples are provided to illustrate that the new approach produces sensible and intuitive results.